Bayesian Hierarchical Models for Serial Analysis of Gene Expression

نویسندگان

  • Seungyoon Nam
  • Seungmook Lee
  • Sanghyuk Lee
  • Seokmin Shin
  • Taesung Park
چکیده

In the Serial Analysis of Gene Expression (SAGE) analysis, the statistical procedures have been performed after aggregation of observations from the various libraries for the same class. Most studies have not accounted for the within-class variability. The identification of the differentially expressed genes based on the class separation has not been easy because of heteroscedasticity of libraries. We propose a hierarchical Bayesian model that accounts for the within-class variability. The differential expression is measured by a distribution-free silhouette width which was first introduced into the SAGE differential expression analysis. It is shown that the silhouette width is more appropriate and is easier to compute than the error rate.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of Hierarchical Bayesian Models for Large Space Time Data of the Housing Prices in Tehran

Housing price data is correlated to their location in different neighborhoods and their correlation is type of spatial (location). The price of housing is varius in different months, so they also have a time correlation. Spatio-temporal models are used to analyze this type of the data. An important purpose of reviewing this type of the data is to fit a suitable model for the spatial-temporal an...

متن کامل

The Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data

The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...

متن کامل

Bayesian Modeling of MPSS Data: Gene Expression Analysis of Bovine Salmonella Infection.

Massively Parallel Signature Sequencing (MPSS) is a high-throughput counting-based technology available for gene expression profiling. It produces output that is similar to Serial Analysis of Gene Expression (SAGE) and is ideal for building complex relational databases for gene expression. Our goal is to compare the in vivo global gene expression profiles of tissues infected with different stra...

متن کامل

Mixed membership analysis of genome-wide expression data

Learning latent “expression themes” that best express complex patterns in a sample is a central problem in data mining and scientific research. For example, in computational biology we seek a set of salient gene expression themes that explain a biological process, extracting them from a large pool of gene expression profiles. In this paper, we introduce probabilistic models to learn such latent...

متن کامل

Latent Aspects Analysis: From Independence to Contagion

Learning latent “expression themes” that best express complex patterns in a sample is a central problem in data mining and scientific research. For example, in computational biology we seek a set of salient gene expression themes that explain a biological process, extracting them from a large pool of gene expression profiles. In this paper, we introduce probabilistic models to learn such latent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006